首页> 外文OA文献 >The DBMS - your Big Data Sommelier
【2h】

The DBMS - your Big Data Sommelier

机译:DBMS-您的大数据侍酒师

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

When addressing the problem of "big" data volume, preparation costs are one of the key challenges: the high costs for loading, aggregating and indexing data leads to a long data-to-insight time. In addition to being a nuisance to the end-user, this latency prevents real-time analytics on "big" data. Fortunately, data often comes in semantic chunks such as files that contain data items that share some characteristics such as acquisition time or location. A data management system that exploits this trait can significantly lower the data preparation costs and the associated data-to-insight time by only investing in the preparation of the relevant chunks. In this paper, we develop such a system as an extension of an existing relational DBMS (MonetDB). To this end, we develop a query processing paradigm and data storage model that are partial-loading aware. The result is a system that can make a 1.2 TB dataset (consisting of 4000 chunks) ready for querying in less than 3 minutes on a single server-class machine while maintaining good query processing performance.
机译:在解决“大”数据量的问题时,准备成本是关键挑战之一:加载,聚合和索引数据的高成本导致较长的数据获取时间。除了对最终用户造成麻烦之外,这种延迟还会阻止对“大”数据进行实时分析。幸运的是,数据经常出现在语义块中,例如文件,其中包含共享某些特征(例如采集时间或位置)的数据项。利用此特征的数据管理系统仅投资相关块的准备工作就可以大大降低数据准备成本和相关的数据收集时间。在本文中,我们开发了这样的系统,作为现有关系DBMS(MonetDB)的扩展。为此,我们开发了支持部分加载的查询处理范例和数据存储模型。结果是,一个系统可以在不超过3分钟的时间内在一台服务器级计算机上准备好1.2 TB数据集(由4000个数据块组成)的查询,同时保持良好的查询处理性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号